How can we learn efficiently to act optimally and flexibly?

نویسنده

  • Kenji Doya
چکیده

W hen we walk to a shop in a town, we want to get there in the shortest time. However, finding the shortest route in a big city is quite tricky, because there are countless possible routes and the time taken for each segment of a route is uncertain. This is a typical problem of discrete optimal control, which aims to find the optimal sequence of actions to minimize the total cost from any given state to the goal state. The problems of optimal control are ubiquitous, from animal foraging to national economic policy, and there have been lots of theoretical studies on the topic. However, solving an optimal control problem requires a huge amount of computations except for limited cases. In this issue of PNAS, Emanuel Todorov (1) presents a refreshingly new approach in optimal control based on a novel insight as to the duality of optimal control and statistical inference. The standard strategy in optimal control is to identify the ‘‘cost-to-go’’ function for each state, such as how much time you need from a street corner to your office. If such a cost-to-go function is available for all of the states, we can find the best route by simply following the nearest state with the lowest cost-to-go. More specifically, we use the formulation

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

W7: Acceptance and Commitment Therapy (ACT)

Acceptance and Commitment Therapy (ACT) is a new development within behavioral therapy. Its goal is to promote psychological flexibility. Numerous studies show that suffering results when people attempt to avoid their inner experiences (experiential avoidance). This experiential avoidance often leads to rigid and inflexible behavior that also leads them to compromise their goals and what is imp...

متن کامل

O1: Defining Talent: A Cultural Perspective

What is talent? How can it be identified? Who is responsible for identifying it? Are there universally valued talents, or are they all culturally bound? There are at least three different levels of analysis to explore these questions. On the government level, we must philosophically decide on how our country chooses and expresses its values through what is taught, to whom, and for what periods ...

متن کامل

مدیر موفق کیست؟

Who is a really successful manager? A manager who spends less money, or the one who earns more? A manager who can survive for a longer period of time, or an administrator who expands his organization, and opens up new branches? Which one is the most successful? The article tries to answer these questions and provides, some simple guidlines for the managers in every domain of management who wan...

متن کامل

On the Path to UHC – Global Evidence Must Go Local to Be Useful; Comment on “Disease Control Priorities Third Edition Is Published: A Theory of Change Is Needed for Translating Evidence to Health Policy”

The Disease Control Priorities (DCP) publications have pioneered new ways of thinking about investing in health. We agree with Norheim, that a useful first step to advance efforts to translate DCP’s global evidence into local health priorities, is to develop a clear Theory of Change (ToC). However, a ToC that aims to define how global evidence (DCP and others) can be used to inform national pol...

متن کامل

How Can a Global Social Support System Hope to Achieve Fairer Competiveness?; Comment on “A Global Social Support System: What the International Community Could Learn From the United States’ National Basketball Association”

Ooms et al sets out some good general principles for a global social support system to improve fairer global competitiveness as a result of redistribution. This commentary sets out to summarize some of the conditions that would need to be satisfied for it to level up gradients in inequality through such a social support system, using the National Basketball Association (NBA) example as a point ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 106 28  شماره 

صفحات  -

تاریخ انتشار 2009